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Abstract 

Presented in this paper is a detailed and direct comparison of the LIGO and Virgo binary neutron 
star detection pipelines. In order to test the search programs, numerous inspiral signals were added 
to 24 hours of simulated detector data. The efficiencies of the different pipelines were tested, 
and found to be comparable. Parameter estimation routines were also tested. We demonstrate 
that there are definite benefits to be had if LIGO and Virgo conduct a joint coincident analysis; 
these advantages include increased detection efficiency and the providing of source sky location 
information. 
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I. INTRODUCTION 

The Laser Interferometer Gravitational Wave Observatory (LIGO) [l] detectors have 
reached their design sensitivity, while Virgo is quickly approaching its target sensitivity. 
This achievement will ultimately be rewarded through the observation of gravitational waves. 
Numerous potential sources exist, producing signals of differing character. The data analysis 
efforts of LIGO and Virgo are aiming to detect and identify of these signals. The inspiral of 
binary compact objects, such as neutron stars or black holes, is one of the most promising 
sources of gravitational waves. The observation of the coalescence of binary compact objects 
will expand our knowledge of the astrophysics of compact objects and provide unique tests 
of general relativity and cosmology LIGO has already conducted searches for binary 
neutron star inspiral signals, and has placed upper limits on source distributions 

LIGO and Virgo have each developed methods (analysis pipeline software) for finding 
binary inspiral signals. In order to maximize the statistical probability of detecting gravi- 
tational waves it is likely that LIGO and Virgo will collaborate, and a necessary beginning 
to such a working relationship will be the validation and understanding of each group's 
detection strategy. In this paper we present the results of a comprehensive study where 
the LIGO and Virgo inspiral search pipelines are compared side by side. We have also con- 
ducted a similar LIGO- Virgo comparison study for a gravitational wave burst search {(J. 
The results presented in this paper show that the LIGO and Virgo binary inspiral detection 
pipelines operate equally well. In addition, we show that by working together there are 
undeniable benefits in the quest for detecting gravitational waves from binary inspirals, and 
that more astrophysical information can be extracted from the signals. For example, the re- 
sults summarized in this paper demonstrate that a detection strategy based on two-detector 
coincidence is improved considerably with Virgo included, as opposed to just using LIGO 
data from the Livingston and Hanford observatories. 

In order to conduct a study that compares the capability for inspiral detection it was 
necessary for both groups to ensure that they were looking for the same signals. LIGO and 
Virgo groups demonstrated to one another that they were, in fact, looking for the exact same 
binary inspiral signal. This was a non-trivial commencement to the study, as the nature 
of the inspiral signal is complicated and only an approximation. Only after this signal 
generation validation study was complete were we able to mutually verify our detection 
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pipelines. The results of this signal validation effort are presented below. 

A previous LIGO- Virgo study showed that both groups were equally competent in find- 
ing inspiral signals from optimally oriented sources p]]. In the present study here the detec- 
tion validation is accomplished using simulated data containing realistic source orientation. 
Specifically, the binary inspiral sources are simulated to come from galaxies M87 and NGC 
6744. Due to the effects of the earth's rotation, and the random orientation of the binary or- 
bital plane, the (simulated) signals impinging on the (simulated) detectors was non-optimum. 
The present study also incorporates the time delays that will be present in the response of a 
network of detectors to gravitational waves from some particular sky direction. In this way 
we hoped to conduct a study whereby the response of the network of the LIGO and Virgo 
detectors was as realistic as possible. For this study we created simulated noise, with the 
noise spectral density matching the target expectations for LIGO and Virgo; the simulated 
noise is Gaussian distributed. 

The paper is organized as follows. In Section [Til we review the results of our previous 
analysis where the LIGO and Virgo inspiral pipelines were compared to simple optimally 
oriented signals. In Section II I II we discuss the comparison of the generation of simulated 
binary inspiral signals, and ensure that LIGO and Virgo are searching for signals of the exact 
same form. In Section IIVI all of the LIGO and Virgo inspiral search pipelines are discussed, 
and their response to the data in the present study are presented; the two Virgo pipelines 
are presented in Section IIV Al and Section IIVBI and the LIGO pipeline is described in 
Section TlV CI The results from a LIGO developed Bayesian parameter estimation technique 
for detected signals are given in Section |V] A side by side comparison of the Virgo and LIGO 
pipeline results are summarized in Section IVIi The benefits of a combined LIGO and Virgo 
inspiral search are presented in Section IVHj we find that there is an increase in two-way 
coincident detection probability, and that there is also the means to gather information on 
the sky location of the source. Concluding remarks, and an outline for the future goals is 
presented in Section FVIIIl 



II. REVIEW OF INITIAL COMPARISON PROJECT 

LIGO and Virgo recently initiated a comparison of their inspiral search pipeline Q. In 
this study each group tested their binary inspiral pipelines on simulated data. A similar 
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study was also conducted for the burst search pipelines For the inspiral study signals 
were created from optimally oriented sources, directly above the interferometers, with noise 
levels comparable to the target sensitivities for LIGO and Virgo. Both LIGO and Virgo 
created signals in order to confirm that the other collaboration's pipeline was able to detect 
them. 

The simulated LIGO signals were created using two mass pairs, [1.4 M , 1.4 M©], and 
[1.0 M , 1.0 M ], at various distances (20, 25, 30 and 35 Mpc), and then inserted into the 
noise. The LIGO h(t) strain signal was created with a sampling rate of 16,384 Hz, and a 
lower frequency cutoff of 40 Hz. A total of 26 signals were spread over 3 hours of data. The 
synthesized Virgo signals were from a [1.4 M , 1.4 M ] mass pair at a distance of 25 Mpc. 
The h(t) strain signal had a sampling rate of 20,000 kHz, and a low frequency cutoff of 24 
Hz; 9 signals were injected into 2.5 hours of data. 

The results from the initial study confirmed the ability of each to correctly detect and 
characterize a binary inspiral signal. The LIGO and Virgo groups both analyzed the data 
created by each group. The LIGO and Virgo pipelines were able to detect the same events, 
and produced comparable parameter estimates for the chirp mass (m c = (mim2) 3 / 5 /( m i + 
m 2) 1 ^ 5 ) ; effective distance, and arrival time. This initial study gave confidence to both 
groups, and also encouraged us to engage in even more comprehensive and challenging tests. 



III. SIGNAL GENERATION 

The results of our previous study [7] were encouraging, but the examination of more 
realistic signals was needed. We decided to create simulated signals that produce a realistic 
detection scenario for LIGO and Virgo. When discussing the inspiral of binary neutron star 
pairs, [1.4 M - 1.4 M ] systems define a convenient reference. For our definition of the 
sensitivity of a detector we use the distance of a [1.4 M - 1.4 M ] binary pair that is in 
the optimal direction (directly overhead) and orientation; such a system producing a signal 
to noise ratio (SNR) of 8 defines our inspiral sensitivity metric. In reality, actual sources 
will impinge on the detectors from directions, orientations, and polarizations that are sub- 
optimal. This then produces a decrease in the signal amplitude, or equivalently a larger 
effective distance. When one averages over all directions, orientations and polarizations, the 
average effective distance to sources is 2.3 times greater than the actual distance LIGO 
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and Virgo were designed such that at their target sensitivities their inspiral ranges would 
extend up to 35 Mpc (for an optimally oriented 1.4 M & - 1.4 M & binary inspiral). Since 
the Virgo cluster of galaxies is within this distance (at 16 Mpc), we decided to have this 
LIGO- Virgo inspiral study respond to signals from the M87 galaxy, as it is the largest galaxy 
in the Virgo cluster. In addition, other signals were created to simulate their emission from 
NGC 6744 at 10 Mpc. The random orientation of the sources, plus the rotation of the Earth, 
created signals that produced a wide variety of responses from the detectors. 

For the study presented in this paper the LIGO and Virgo groups each created 24 hours of 
simulated data. The LIGO detectors modeled were the 4 km systems at Hanford WA (HI), 
and Livingston LA (LI), while the Virgo detector was that at Cascina, Italy (VI). The noise 
of the data matched that of the target sensitivities for the LIGO and Virgo interferometers. 
Fig. [T] shows the noise spectral densities for the simulated data. Within the 24 hours of data 
144 inspiral signals were injected from the two galaxies; the sources came from systems with 
random orbital plane orientations. The masses of the binary system stars were within the 
1M to 3M range. 

In order to initiate our study of the LIGO and Virgo inspiral detection capacities we 
ensured that the production of 2.0 post-Newtonian (PN) [lOj binary inspiral signals for 
1.4 Mq - 1.4 M & mass pairs by both groups was identical. This exercise highlighted three 
important areas that need to be closely monitored in studies dependent on simulated inspiral 
signal generation. It is important to use the exact same value of Newton's constant, G. 
Second, the lengths of the inspiral chirps need to be monitored. Finally, the definition 
for the termination frequency of the inspiral signal needs to be defined in the same way. 
Since the signals calculated come from approximation methods in general relativity it is not 
surprising that slight discrepancies can occur. At 2.0 PN one can write the frequency of 
the waveform as a function of time f(t), or the time before coalescence as a function of the 
frequency t(f). These two functions are inverses of each other only to 2.0 PN order. In the 
LIGO code, the length of the chirp is determined by solving f(t) = f\ ow using a root finder. 
However, in the Virgo code, the length of the chirp is found by determining t(fi ow ). For 
a 1.4-1.4 solar mass inspiral beginning at 30 Hz, the LIGO code generates a signal that is 
54.6789 s, while the inspiral generated by the Virgo software is 54.6799 s, so the calculated 
length of the chirp differs by 1 ms between the two methods. In the test we are performing, 
this leads to an offset between the two chirps. It was possible to obtain agreement between 
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FIG. 1: The noise spectrum (at design sensitivity) of the simulated data for LIGO (solid line) and 
Virgo (dashed line). The distortion close to the Nyquist frequency in the Virgo spectrum is due to 
the use of a low pass filter applied before downsampling the Virgo data generated at 40 kHz down 
to 20 kHz. 

the two waveforms by simply sliding the Virgo waveform backwards by 1 ms. For the signal 
termination there was a slight difference found between the LIGO and Virgo methods, which 
was due to the two codes using a somewhat different ending frequencies. LIGO uses the 
test mass innermost stable circular orbit (ISCO). Virgo nominally uses the last stable orbit 



(LSO, the occurrence of the minimum of the dynamical energy, see 10]) calculated at 2PN 



order, or when the phasing formula defined in pjj breaks down at 2.0 PN order (which was 
the case for the Virgo generated signals in present study). This then results in an additional 
cycle at the very end of the Virgo waveform, but we found that this contributes negligibly 
to the signal to noise ratio for the binary neutron star coalescence, and the detection results. 
When one does calculations at a finite order in a perturbative expansion, it is unavoidable 
to meet quantities that do not yield the same value when computed via different methods, 
and shifting the Virgo waveform by 1 ms was a way to show that this perturbative issue can 
be traded with a redefinition of the time. 
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The signals generated by LIGO and Virgo in this present study both used the same 
definition of G, specifically G = 6.67259 x 10~ n m 3 A;g -1 s~ 2 . The LIGO and Virgo signals 
were adjusted so that the coalescence times and phases were the same, as these are some 
of the important parameters to be estimated by the detection pipelines. The sky locations 
were determined by the galaxies. The angle parameters for the signals (phase at coalescence, 
polarization, the cosine of the orbital plane inclination) were chosen randomly from uniform 
distributions. The masses were selected from a small set (1.0, 1.4, 2.0, and 3.0 M & ). The 
injections were spaced in time so that they would not significantly affect the power spectrum 
estimation; the injection times chosen randomly with an average of one injection every 600 
s. In the end we were content with the overall similarity of the LIGO and Virgo signals, and 
proceeded with the detection study. 



IV. VIRGO AND LIGO INSPIRAL DETECTION ROUTINES 

Virgo has developed two search pipelines for binary neutron star inspiral signals, the 
purpose of which is to experiment with different analysis solutions and cross-check their 
outputs; the plan is to keep developing both methods, because it is anticipated that the two 
may have different merits when applied to different portions of the parameter space, and/or 
to different kinds of binary systems (black holes, inclusion of spin, etc.). One Virgo method is 
a multi-band templated analysis (MBTA), whereby the templates are split for efficiency into 
low and high frequency parts [ijj]. The templates are then subsequently combined together 



in a hierarchical way. Virgo also has a standard flat-search pipeline, called Merlino 



that is similar to the LIGO inspiral pipeline 



Qfl. 



Both of these Virgo inspiral detection 



pipelines, as well as the single LIGO pipeline, were applied to the data in this study. A 
summary of the basic details of the Virgo and LIGO inspiral detection pipelines is given in 
Table HJ detailed descriptions are in the sub-sections below. The range of masses covered by 
the templates was 1-3 M . The template banks were created with a minimal match criterion 
of 95%, ensuring that no event in that mass space would be detected with a SNR loss greater 
than 5%. The starting frequency f[ ow for the analysis of the data, in order to be consistent 
with the SNR accumulation (defined by the sensitivity curves of both experiments) was set 
to 40 Hz for LIGO data and to 30 Hz for Virgo data. Triggers were recorded when the 
SNR exceeded a threshold of 6. With the trigger lists from each of the pipelines, an event 
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is labeled as true if the end time of the inspiral event matches the end time of the injected 
inspiral event within ±10 ms. 





LIGO dataset 


Virgo dataset 


Mass range 


1-3 M 


1-3 M 


Grid minimal match 


95% 


95% 


Starting frequency fi ow 


40 Hz 


30 Hz 


Longest template duration 


~ 45 seconds 


~ 96 seconds 


SNR threshold 


6 


6 



TABLE I: Common search parameters for the LSC and Virgo pipelines. 



The mass parameter space layout for the LIGO grid is described in 141 ] . The Virgo 
MBTA pipeline creates the grid according to a 2D contour reconstruction technique based 
on the parameter space metric [isj]. The Virgo Merlino template placement is explained in 
["Lsl . Table [TT] provides information about the way the production was done for the three 
pipelines and each data set, which type of processor was used, and other resources needed. 



Pipeline 


LIGO 


MBTA 


Merlino 


LIGO 


MBTA 


Merlino 


Data Set 


LIGO 


LIGO 


LIGO 


Virgo 


Virgo 


Virgo 


Number of templates 


~ 2900 


~ 1900 


~ 2000 


~ 10900 


~ 7000 


~ 6800 


Type of processor 


1 GHz 
Pentium II 


2.2 GHz 
Opteron 


2.2 GHz 
Opteron 


2.66 GHz 
Xeon 


2.2 GHz 
Opteron 


2.2 GHz 
Opteron 


Total processing time 


~ 368 hours 


~ 55 hours 


~ 50 hours 


~ 704 hours 


~ 231 hours 


250 hours 


processing time X processor speed 
number of templates 


~ 457 s GHz 


~ 229 s GHz 


198 s GHz 


~ 618 s GHz 


~ 261 s GHz 


291 s GHz 



TABLE II: Configuration and computing cost of each analysis. 



Table II I II summarizes the detection efficiency results of the three inspiral pipelines on 
their application to the data sets used in this study. Table IIVI provides the information 
on the accuracy of each of the three pipelines for parameter determination for the signals 
detected in the VI data. These results are explained in the sections below. 
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% detected 
in LI data 


% detected 
in HI data 


% detected 
in VI data 


FAR for 
LIGO data 


FAR for 
Virgo data 


LIGO pipeline 


64% 


66% 


62% 


0.07 Hz 


0.77 Hz 


MBTA pipeline 


62% 


61% 


56% 


0.02 Hz 


0.1 Hz 


Merlino pipeline 


55% 


59% 


55% 


0.03 Hz 


0.1 Hz 



TABLE III: The signal detection results for the LIGO, MBTA and Merlino inspiral detection 
pipelines applied to the LI, HI and VI data. The false alarm rate (FAR) for the pipelines applied 
to the data is also listed. 





chirp mass 
difference 
mean (Mq) 


chirp mass 
difference 
RMS (M ) 


end time 
difference 
mean (ms) 


end time 
difference 
RMS (ms) 


effective distance 
fractional error 
mean 


effective distance 
fractional error 
RMS 


MBTA pipeline 


2.46 x 10~ 4 


9.13 x 10~ 4 


0.340 


0.876 


-0.0031 


0.10 


Merlino pipeline 


1.60 x 10~ 4 


1.00 x 10~ 3 


0.083 


1.03 


0.01 


0.11 


LIGO pipeline 


2.79 x 10~ 4 


9.04 x 10~ 4 


0.966 


1.29 


-0.026 


0.112 



TABLE IV: Parameter determination accuracy results for the LIGO, MBTA and Merlino inspiral 
detection pipelines applied to VI data. The values in the table are derived from the distribution 
of all events detected by the respective pipeline. The parameter difference is defined as the actual 
parameter value subtracted from the pipeline's recovered value; results are given for the chirp 
mass (in units of Mq) and end time (in units of s). Also listed is the effective distance fractional 
error, which is defined as (recovered effective distance - actual effective distance) / (actual effective 
distance) . 

A. Results for Virgo multi-band analysis of inspiral signals 

The Virgo MBTA inspiral detection pipeline was designed to reduce the computational 
cost of a binary inspiral search. A large number of templates are needed to sufficiently cover 
the parameter space, especially for binary systems containing relatively smaller masses. The 
required number of templates depends on the duration of the longest possible signal, and 
this is affected by the relatively slow frequency evolution of the binary inspiral at lower 
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frequencies. The computational speed of a Fast Fourier Transform (FFT) depends on the 
frequency span of the data set. The matched filtering technique uses FFTs as part of the 
computation. The goal of the MBTA technique is to split the analysis into low and high 
frequency parts; these results are subsequently combined in a hierarchical way. A description 
of the method can be found in fl2|. 

The MBTA pipeline was applied to data from HI, LI and VI. Detected events with 
SNR > 6 were recorded, and clustered both in time and over the template bank; triggers 
with matching endtimes (within ±10 ms) were considered the same event, and the trigger 
with the highest SNR was recorded; if this trigger was within ±10 ms of an injection event 
end time then the it was specified as a detection. The template bank spanned the range 
from IMq to 3M and had a minimal match of 0.95; a total of 7000 templates were used to 
analyze the Virgo data set, and 1900 templates for the LIGO data. The Virgo MBTA code 
was run with a splitting frequency between the low and high frequency bands chosen so as 
to share in an approximately equal way the SNR between the two bands; the band splitting 
frequency was 130 Hz when applied to the LIGO data, and 95.3 Hz for the Virgo data. With 
these settings the single instrument false alarm rate was 0.1 Hz for the Virgo data, and 0.02 
Hz for the LIGO data. In general, the higher false alarm rate with the Virgo data (seen with 
all the pipelines) is due to the larger number of templates used, which itself is due to the 
lower frequency cut-off of the search. Based on the number of signals found in the data set, 
the single interferometer detection efficiencies for the MBTA pipeline was 61% for signals in 
the HI data, 62% for events detected in the LI data, and 56% for signals in the VI data. 
The actual value of the efficiency depends entirely on the source population (and resultant 
effective distance distribution) chosen. Because we have chosen a population for which the 
efficiency is roughly 50%, we are particularly sensitive to small differences in the pipelines 
and algorithms, which is the goal of this study. There were a total of 144 injections, so the 
statistical error on the detection efficiencies is around 4% for each detection pipeline. The 
injections that were not found had effective distances exceeding the range of the instrument, 
typically greater than 50 Mpc. 

The ability to quantify the accuracy with which we can recover various injected parame- 
ters was an important goal for this study of the inspiral detection pipelines. Using the Virgo 
signal data histograms were created for the parameter determination difference, with Fig. [2] 
for the chirp mass, and Fig. [3] for the end time. The difference is defined as (recovered - 
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FIG. 2: Histograms of the chirp mass determination accuracy for the Virgo MBTA, Virgo Merlino, 
and LIGO inspiral analysis pipelines. These three pipelines were applied to the Virgo data, and 
displayed are the results from the signals recovered. The difference is defined as the actual chirp 
mass subtracted from the recovered chirp mass. 

injected). Fig. H] displays the effective distance parameter determination in terms of frac- 
tional error, while Fig. [5] displays the fractional error in the detected effective distance versus 
the actual effective distance. The effective distance fractional error is defined as (recovered 
effective distance - actual effective distance) /(actual effective distance). In Fig. [5] one can 
see that for large injected distances, the recovered distance tends to be less than the injected 
distance; this is because these events are near to the threshold of the search. If the noise 
acts to make the signal weaker (i.e. to increase the effective distance) the event will not 
be detected. However, if the noise acts to make the signal stronger (i.e. to decrease the 
effective distance) the pipeline will detect the injected signal above threshold; this effect was 
observed in all the inspiral detection pipelines that we studied. 
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FIG. 3: Histograms of the end time determination accuracy for the Virgo MBTA, Virgo Merlino, 
and LIGO inspiral analysis pipelines. These three pipelines were applied to the Virgo data, and 
displayed are the results from the signals recovered. The difference is defined as the actual end 
time subtracted from the recovered end time. 

B. Results for Virgo flat band search pipeline of inspiral signals 

One of the main goals for Virgo is the realization of a reliable real time observation strat- 
egy in order to use the interferometer as a gravitational wave observatory. With this aim, 
the computational strategy was carefully designed for a binary inspiral search by addressing 
the computational size of the problem, available computational resources, communication 
requirements, data handling, and the constraints due to real-time analysis conditions. The 
Distributed Signal Analyzer (DiSA) jl^ . named Merlino, is a particular binary inspiral 
search solution implemented in Virgo, based on a parallel-distributed applications environ- 
ment. This framework is composed of several processes communicating via Message Passing 
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FIG. 4: Histogram of the effective distance fractional error for the Virgo MBTA, Virgo Merlino, 
and LIGO inspiral analysis pipelines. These three pipelines were applied to the Virgo data, and 
displayed are the results from the signals recovered. The effective distance fractional error is defined 
as (recovered effective distance - actual effective distance) / (actual effective distance) . 

Interface (MPI), distributing and controlling user algorithms and the data. The algorithms 
can be dynamically changed and inserted in the Merlino logic data flow using a plug-in 
strategy. Specifically, the coalescing binaries plug-ins have been used to analyze the data 
examined in the study presented in this article; the overlap-add data handling method and 
the storage in memory of the template bank are implemented in order to further speed-up 



the analysis 13j 



A total of 6800 templates were used to cover the 1.0M Q to 3.0M Q range in the Virgo 
data, while 2000 templates were used for the LIGO data; the template bank had a minimal 
match of 0.95. A x 2 threshold [if]] with 15 bands was used, and triggers were required to 
have a SNR > 6. All events within ±10 ms were clustered, and the event with the largest 
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FIG. 5: The effective distance determination accuracy as a function of actual effective distance, for 
the Virgo MBTA, Virgo Merlino, and LIGO inspiral analysis pipelines. These three pipelines were 
applied to the Virgo data, and displayed are the results from the signals recovered. The effective dis- 
tance fractional error is defined as (recovered effective distance - actual effective distance) / (actual 
effective distance). 

SNR that satisfied the x 2 cu t was selected as the trigger; that is, if this trigger was within 
±10 ms of an injection event end time then it was specified as a detection. This flat search 
code found 55% of the binary inspiral signals injected into the VI data, 59% of the signals 
in the HI data, and 55% of the signals in the LI data. The false alarm trigger rate was 0.1 
Hz for the Virgo data, and 0.03 Hz for the LIGO data. 

The performance of the Virgo Merlino pipeline to resolve signal parameters was compa- 
rable to the other pipelines, and example results are displayed here for the analysis of the 
Virgo data set. Fig. [2] displays the ability of the Merlino code to accurately determine the 
chirp mass. The accuracy of the end-time parameter is presented in Fig. [3], while Fig. H] 
displays the effective distance parameter estimation. Fig. also displays the fractional error 
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in the detected effective distance versus the actual effective distance. 
C. Results for LIGO inspiral detection routine 

The LIGO inspiral detection pipeline has been used to search for signals and set upper 
limits with data from LIGO's first two scientific data runs 0, 0]. These publications also 
present detailed descriptions of the LIGO inspiral pipeline. This same LIGO inspiral pipeline 
was applied to the 24 hours of data for this present study. The simulated signals from HI, 
LI and VI were all examined with the LIGO code. 

For the initial single detector test a threshold of SNR > 6 was used, with no \ 2 threshold, 
or mass consistency check. The data was high-pass filtered (including the injections) for each 
interferometer individually. The template bank spanned the range from 1M Q to 3M and 
had a minimal match of 0.95; a total of 10900 templates were used to analyze the Virgo 
data set, and 2900 templates for the LIGO data. When calculating the detection efficiency 
we required that a candidate trigger occur within ±10 ms of the injected signal's end-time. 
There was no clustering over the template bank. For the HI data 66% of the inspiral signals 
were detected, while the pipeline found 64% of the LI signals. The efficiency of signal 
detection efficiency was 62% for the VI data. For the LIGO pipeline the false alarm trigger 
event rate was 0.07 Hz when analyzing the LIGO data and 0.77 Hz for the Virgo data set. 

The recovered parameter values correspond to the template producing the largest SNR 
trigger within 10 ms of the actual end time. Using the Virgo signal data, we created 
histograms of the parameter determination difference (Fig. [2] for the chirp mass, Fig. [3] for 
the end time, and Fig. H]for the effective distance), as well as a plot, Fig. that displays 
the fractional error in the detected effective distance versus the actual effective distance. 

V. MCMC PARAMETER ESTIMATION 

Parameter estimation, and the generation of a posterior probability density function 
(PDF) for each parameter, was also done utilizing a Markov chain Monte Carlo (MCMC) 
routine. These MCMC methods are part the LIGO binary inspiral data analysis effort. 
The basic operation of the inspiral MCMC code is described in [17), which also contains a 
description of MCMC techniques. The purpose of this code is to take triggers generated 
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from the LIGO inspiral pipeline, and then examine that section of the data about the event. 
This MCMC code was applied to those sections of data in this study where coincident events 
were found. The MCMC is not a search pipeline, as the program is too computationally 
taxing to be applied to all of the data. The MCMC searched for events that had a binary 
coalescence end-time within a ±50 ms window of coincident triggers from the LIGO inspiral 
search pipeline. 

The MCMC code used in the present study was written in C, and looked for inspiral 
signals based on 2.0 PN signals in the frequency domain. For the present study the prior 
for the masses of the compact objects was uniform from 0.9 M & to 3.1 M & range. For this 
problem there are five parameters to estimate: the binary masses, mi and rri2, the effective 
distance di, the phase at coalescence C and the time at coalescence t c . The program 
reparameterizes the masses in terms of the chirp mass m c = (mim 2 ) 3 / 5 /(m 1 + m 2 ) 1//5 and 



the mass ratio parameter rj = m 1 m 2 /(mj 



method called importance resampling 
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m 2 ) 2 . This inspiral MCMC code begins with a 



19]; at the start the code first generates a large 



sample of parameter space points from a distribution covering the whole prior, and then 
draws the actual sample out of these with correspondingly assigned weights depending on 
the posterior density. The Markov chains are started in regions of parameter space that 
are likely to be close to the true parameter values. Simulated annealing |20| was used to 
optimize the initial burn-in of the Markov chains. During the burn-in period, the effect of the 
noise in the likelihood function is arbitrarily increased (an effective temperature increase); 
this simulated annealing technique was introduced in [21[ and allows scanning of the whole 
parameter space by permitting larger steps. The candidate generating function [17| for the 
parameters is designed so that correlations between chains are measured as the program 
progresses, and correlation values are fed back to the generating function. 

The MCMC parameter estimation code was applied to all of the LIGO- Virgo data from 
this study, and here we show examples from the analysis of the LIGO signals. Fig.[6]shows an 
example of MCMC generated estimates for the posterior probability distribution functions 
for a simulated event in the HI data detected with the LIGO pipeline. This signal had 
real parameter values of mi = 1.4M , m 2 = 3.OM (m c = 1.759M and rj = 0.217), 
di, = AlAlMpc and t c = 49.9815s. From the generated posterior PDFs estimates for 
the parameter values can be associated with the mean of the distribution. The error in 
the parameter estimate would correspond to a particular width. We will give the 5 to 
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FIG. 6: MCMC produced posterior PDFs for chirp mass m c , mass ratio 77, effective distance di and 
the coalescence time t c for one event. In this example the binary inspiral signal is embedded in the 
HI data. The actual parameters used to produce this signal, displayed by the dashed vertical lines, 
are m 1 = 1.4M , m 2 = 3.OM (m c = 1.759M© and 77 = 0.217), d L = 41.47Mpc and t c = 49.9815s. 

95 percentile range of the posterior distribution, which gives a 90% posterior credibility 
interval. For this example the parameters' mean values and 90% credibility ranges are 
m c = 1.7597 (1.7580 - 1.7623) M Q , rj = 0.2215 (0.2126 - 0.2407), d L = 46.669 (36.936 - 
60.023) Mpc, and t c = 49.9823 (49.9815 - 49.9837) s. 

As an example of the parameter estimation accuracy, Fig. [7] shows the difference between 
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FIG. 7: The parameter estimation accuracy for the MCMC chirp mass m c , mass ratio rj, effective 
distance di and the coalescence time t c are shown. Displayed are the differences between the 
MCMC parameter estimated and the actual value, as a function of effective distance. These events 
were from the HI and LI data. The error bars correspond to the 90% credibility ranges. 

the MCMC estimated parameter values and the real values, as a function of effective distance. 
The errors represent the 90% credibility ranges. These events were from the simulated 
inspiral signals found by the LIGO pipeline in both the LI and HI data sets. All mass 
combinations are represented in this plot. 

VI. LIGO AND VIRGO INSPIRAL DETECTION PIPELINE COMPARISON 

The Merlino, MBTA and LIGO binary inspiral pipelines all operated in a comparable 
way, and produced good single interferometer detection statistics. Triggers were recorded, 
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and those with the largest SNR had the parameter values for the templates noted. The 
ability to resolve the parameters was equally positive (within statistical errors and other 
uncertainties) for all of the pipelines. A major conclusion from our study is that the three 
inspiral search pipelines performed equally well, and there is mutual confidence between the 
groups in the others' inspiral search abilities. The comparisons presented in this section 
were all conducted using the Virgo data. 

In order to display the output of the pipelines, we present here a direct comparison of the 
Merlino, MBTA and LIGO results from the VI data. Fig. [H] (left) shows a histogram of the 
ratio of the MBTA SNR to the LIGO SNR for the events detected. Similarly, Fig. E (right) 
shows a histogram of the ratio of the Merlino SNR to the LIGO SNR for the events detected. 
There were some slight differences in the results between the pipelines. For example, the 
SNR of detected events was about 6% larger from the LIGO pipeline versus those from the 
MBTA pipeline, and about 8% larger than those from the Merlino pipeline. 

The difference in the SNRs is predominantly affected by slightly differing methods for 
calculating the noise power spectral density (PSD). Specifically, the LIGO pipeline calcu- 
lates its PSD via the median power spectrum (the median is calculated frequency-bin by 
frequency-bin) of 15 overlapping segments, where each segment was 256 s in length. The 
median was chosen for the LIGO pipeline PSD generation so that it would not be overly 
biased by a single loud glitch (as could happen when using the mean), at the cost of a larger 
bias in the case of gaussian noise. The PSD calculated by the Virgo pipeline is computed 
from a larger number of averages, as the mean from 1800 s of data, each section 16.38 s long, 
with no overlap. We conducted a direct comparison of the PSDs generated by the LIGO and 
Virgo pipelines. The estimate of the SNR 2 scales with frequency like f~ 7 ^ 3 /S(f), where 
S(f) is the noise PSD. When we sum f~ 7 ^ 3 /S(f) over the frequencies from 40 to 2048 Hz 
the LIGO result exceeds that of Virgo by 10%, giving an overestimation in the LIGO SNR 
of 5%. Numerical experiments have subsequently verified that increasing the number of 
segments used in calculating the noise PSD will reduce the bias. It should be noted that 
the bias from the noise PSD strictly cancels out in the estimation of the effective distance. 

We were also concerned that the frequency domain stationary phase approximation used 
by LIGO could affect the SNR estimation. Specifically, the signals for this study were 
generated to 2.0 PN order in the time domain. For both Virgo pipelines the frequency 
domain detection templates are the Fourier transforms of the time domain 2.0 PN signals. 
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On the other hand, the LIGO pipeline uses 2.0 PN frequency domain templates generated 
via a stationary phase approximation. We studied the overlap between Fourier transforms of 
2.0 PN time domain signals and the stationary phase frequency domain signals. Our results 
indicate that the SNR for the LIGO would be reduced by at most 1% (and only for signals 
from the highest mass pairs). In addition we verified the effectualness and faithfullness of the 
signals used in this study, similar to what was done in 22j. Specifically for the mass pairs 
used in this study, the faithfulness varies between 96% and 98%. Lower values correspond 
to the larger mass ratios. The effectualness is always above 99%, except for the 3M - 3M 
mass pair, for which the effectualness is 98.8%. The accuracy in the estimation of the chirp 
mass using the stationary phase templates ranges from a few parts in 10~ 5 to a few parts in 
10" 4 . 

We also examined how template placement in the mi versus m 2 plane grid would affect 
the recovered SNR, and we found that for our studies this had a small effect on the SNR 
difference. The density of the template grid is higher for the LIGO pipeline than the Virgo 
pipelines. Specifics of the template placement for the inspiral pipelines used in our study are 
presented in [7|]. When examining the SNR ratio from the MBTA and LIGO pipelines we 
found the excess value from the LIGO pipeline to be consistently present for the cases when 
the two masses were equal, or when they had a relatively large difference. The increased 
density of the LIGO grid would give an excess in SNR, but for the study presented here it 
appears that the effect would not produce a SNR difference exceeding 1%. 

From these studies we believe that we understand how differences are created in the 
SNR of the inspiral triggers by the LIGO and Virgo pipelines. The slightly higher detection 
efficiency produced by the LIGO pipeline is a direct consequence of its elevated value of the 
produced SNR ratio, which in turn is due to the difference in the methods used to estimate 
the noise PSD. All three pipelines in this study used a SNR > 6 threshold. Those events 
near this cutoff would have a slight preference of being seen by the LIGO pipeline. When 
one accounts for the SNR artifact the detection efficiencies of all of the pipelines are seen to 
be the same. 

Another small difference in the LIGO and Virgo pipelines concerns the estimate of the 
endtime. The LIGO pipeline typically estimates an endtime that is 1 ms off from the actual 
value. This can be seen in the output of the LIGO pipeline in Fig. [3] and the MCMC 
generated parameter estimates in Fig. [7J The 1 ms offset in the LIGO end time estimation 



21 



9 
8 

7- 

6 

5 

4- 

3 

2- 

1 - 



mean = 0.942 
RMS =0.025 




0.85 0.9 0.95 1 1.05 

SNR ratio (MBTA/LIGO pipeline) 



mean = 0.92 
RMS = 0.03 



www 



0.85 0.9 0.95 1 1.05 

SNR ratio (Merlino/LIGO pipeline) 



FIG. 8: On the left is a histogram of the ratios of the SNRs for MBTA and LIGO events detected 
by both pipelines in the VI data. The mean and RMS values for the distribution are given in the 
figure, showing a 6% excess in the size of the LIGO SNR. On the right is a histogram of the ratios 
of the SNRs for Merlino and LIGO events detected by both pipelines in the VI data; there is a 8% 
excess in the size of the LIGO SNR. 

is also apparent in Fig. [HI (left), where the end time accuracy for MBTA and LIGO events 
detected by both pipelines in the VI data is presented. Similarly, a comparison of the LIGO 
and Merlino estimates of the end time also display an offset of about 1 ms for the LIGO 
results; see Fig. |9] (right). This small difference in the nature of the detection templates 
(frequency domain stationary phase for LIGO, and Fourier transforms of the time domain 
templates for Virgo) is responsible for the small time shift. 

Fig. [TU] (left) displays a comparison of the accuracy of the determination of the chirp 
mass for events detected by both the MBTA and LIGO pipelines from within the VI data 
set, while a similar chirp mass comparison from the LIGO and Merlino pipelines is displayed 
in Fig. [10] (right). In Fig. (TT] (left) we see the recovered effective distance divided by the 
actual injected effective distance for the signals detected by MBTA and LIGO in the VI 
data, while FigfTT] (right) shows a similar effective distance comparison between the Merlino 
and LIGO pipelines. The chirp mass and effective distance estimates by LIGO, Merlino and 
MBTA were essentially the same. 
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FIG. 9: On the left is a scatter plot of the end time accuracy for MBTA and LIGO events detected 
by both pipelines in the VI data. The accuracy is defined as the actual end time subtracted from 
the recovered end time. The mean and RMS values are given in the figure, and show that the 
LIGO inspiral pipeline tends to over estimate the end time by about 1 ms. On the right is a scatter 
plot of the end time accuracy for Merlino and LIGO events detected by both pipelines in the VI 
data; again, the LIGO inspiral pipeline tends to over estimate the end time by about 1 ms. 

VII. BENEFITS FROM A COMBINED LIGO- VIRGO INSPIRAL SEARCH 

This close examination of the LIGO and Virgo binary inspiral search pipelines has also 
provided additional information on the benefits one can get in conducting a mutual search 
for signals. In the effort to detect binary inspiral gravitational waves the inclusion of Virgo 
data significantly increases the probability for observing a coincident signal in at least two 
interferometers. The results presented above show that each pipeline is able to effectively 
detect events with sufficiently low effective distance, and recover the chirp mass, coalescence 
time, and effective distance. Since the effective distance value is influenced by detector 
orientation and source location, it is not a parameter that can be used as a test for coincident 
detection. However, the chirp mass and coalescence time can be used to require consistency. 
We set coincident window sizes of O.2M for the chirp mass and ±8 ms about the light travel 
time between interferometers for the coalescence time (for reference, the light travel times 
between the detectors are 27.2 ms for Hanford - Virgo, 26.4 ms for Livingston - Virgo, and 
10 ms for Hanford - Livingston), and a SNR > 6 threshold. With these settings we found 
no triple coincident false events, and only one double coincident false alarm. 
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FIG. 10: On the left is a scatter plot of the chirp mass accuracy for MBTA and LIGO events 
detected by both pipelines in the VI data. On the right is a scatter plot of the chirp mass accuracy 
for Merlino and LIGO events detected by both pipelines in the VI data. The accuracy is defined 
as the actual chirp mass subtracted from the recovered chirp mass. The mean and RMS values are 
given in the figures, and show that the Virgo MBTA, Virgo Merlino, and LIGO inspiral pipelines 
all estimated the chirp mass accurately. 

The double coincidence detection ability for the injected signals increased with the inclu- 
sion of VI over just the Hl-Ll coincidence alone. This is summarized in Table 1, where the 
triple coincidence results and various two detector coincidence results are presented. The 
efficiency for detection of injections from NGC 6744 is larger since it is closer than M87. 





HLV 


HL 


HV 


LV 


HL U HV U LV 


NGC 6744 efficiency 


48% 


65% 


54% 


49% 


72% 


M87 efficiency 


24% 


42% 


32% 


30% 


56% 



TABLE V: The efficiency of detecting inspiral injections from NGC 6744 (at a distance of 10 Mpc) 
and M87 in the Virgo cluster (at a distance of 16 Mpc) using different combinations of the LIGO 
and Virgo detectors and an SNR threshold of 6 in all detectors. These results are from the MBTA 
pipeline. 

The coincidence results show the benefits of performing a search including all three de- 
tectors. The highest efficiency is obtained by requiring a signal to be observed in any two 
of the three detectors. For the closer NGC 6744 galaxy, the main advantage of adding the 
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FIG. 11: On the left is a scatter plot of the effective distance determination ratio for MBTA and 
LIGO events detected by both pipelines in the VI data; on the right is the same but for Merlino 
and LIGO events. The parameter determination ratio is defined as the detected effective distance 
divided by the actual injected effective distance. 

Virgo detector to a LIGO only search is the good triple coincident efficiency. Not only is 
the triple coincident false alarm rate very low, but also with a trigger in three detectors we 
can reconstruct the sky location of the source. 

For signals from both M87 and NGC 6744, the two detector LIGO efficiency is greater 
than either the HI- VI or Ll-Vl efficiency. This is expected due to the similar orientations of 
the two LIGO detectors. However, by including Virgo and requiring a coincident trigger in 
two of the three detectors, we do obtain a 25% relative increase in efficiency. The M87 galaxy 
is in the Virgo cluster, which contains a significant fraction of potential binary neutron star 
inspiral sources for the initial interferometric detectors. A 25% increase in efficiency to these 
sources significantly increases the chance of making a detection. 

The reason for the increase in two detector efficiency can be seen in Fig. [121 Displayed 
are the detected and missed signals for the LIGO pipeline on the HI data, the Virgo Merlino 
pipeline on the LI data, and the Virgo MBTA for the VI signals. One can see in these plots 
that the reason for signals being undetected, in all three pipelines, is due to large effective 
distance values. It can be seen that over the course of 24 sidereal hours there is a variation 
in the ability to detect signals from the two source galaxies due to changes in interferometer 
alignment as the earth rotates. Virgo's inclusion provides detections at times when the 
LIGO interferometers' alignments may be sub-optimal. 
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FIG. 12: The detected (o) and missed (x) events for the LIGO pipeline on the HI data, the Virgo 
Merlino pipeline on the LI data, and the Virgo MBTA pipeline on the VI signals. Note too that 
the dependence of detection efficiency versus effective distance can be seen in these plots; events 
with an effective distance in excess of 50 Mpc are difficult to detect. 

A. Source Directional Information 

The use of three widely spaced interferometers provides the opportunity for identifying 
the location on the sky of a binary inspiral event. This is another positive outcome of a 
combined LIGO and Virgo search. Fig. [13] shows the recovered sky position for events that 
were successfully detected in HI, LI and VI. The MBTA pipeline was used to identify the 
triple coincidences using clustered triggers; for this demonstration we selected the highest 
SNR trigger within ±10 ms of injection, in each interferometer. 

The determination of the sky position is affected by the estimates for the other param- 
eters, as the reconstructed values of the parameters are not independent. For example, a 
higher mass binary inspiral will traverse the sensitive band of the detectors more rapidly 
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FIG. 13: The recovered (dots) and injected 
(crosses) sky locations of the inspiral injec- 
tions seen in all three detectors. For ref- 
erence, the galaxy NGC is located at a = 
19 hr 9.8 m, 6 = -64° and M87 is located 
at a = 12 hr 30.8 m, S = 12°. 



than one of lower mass. The reconstructed coalescence time and masses of the system will be 
correlated. These correlations make it difficult to determine the coalescence time, and then 
sky position, with good accuracy. It was possible, however, to improve on the sky position 
determination. Again using the MBTA pipeline, the search for a signal in HI and LI data 
was restricted such that the identified clusters for individual triggers were only those issued 
by the same template as the one leading the identified cluster in the VI data (choosing the 
highest SNR if more than one is found). Specifically, the analysis of the HI and LI data 
was redone with the same template grid as was used to analyze the VI data. However, if 
no trigger with the same templates as in VI can be found in HI or LI, then LI and VI are 
examined for triggers with the same masses as in HI, and if not successful do the same with 
LI as a reference. By conducting the search accordingly the ability to estimate the mass 
parameters (and hence the end time and sky position) improves, and we end up with 39 
injections (out of 49 triple coincident detections) found with the same template in mi iri2 
space. The order in which each interferometer is used in turn as a reference is somewhat 
arbitrary. However for the present study the reason for starting with Virgo is due to the 
better mass resolution of using the VI data (due to the better low frequency sensitivity). 
Fig. [TH shows the sky position accuracy obtained after enforcing this mass correction tech- 
nique. The position accuracy distribution (recovered position minus true position) for the 
16 events from galaxy M87 had a mean of 4.1° and an RMS of 3.5°, while for the 23 events 
from galaxy NGC 6744 the position accuracy distribution had a mean of 2.3° and an RMS 
of 1.2°. 
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FIG. 14: The recovered and injected sky loca- 
tions of the subset of inspiral injections which 
are seen in all three detectors using exactly 
the same template in mi rri2 space. The 39 
detected events in this figure were the ones 
found to have the same mass parameters for 
the triggers, as described in the text. 

VIII. SUMMARY 

LIGO and Virgo both have efficient data analysis pipelines for detecting gravitational 
waves from the inspiral of binary neutron star systems. It is likely that in the near future 
LIGO and Virgo will undertake a mutual search for binary inspiral signals. Both groups 
now have confidence in each other's ability to accurately detect these signals. The results 
presented in this paper validate the performance of all the LIGO and Virgo inspiral search 
pipelines. 

A combined search for inspiral signals by LIGO and Virgo will also produce significant 
advantages. The probability for a event to be seen simultaneously by two detectors increases 
significantly when Virgo is included with LIGO in an inspiral search. Also, the ability to 
locate the sky position of an inspiral event becomes possible when observed by Virgo and 
interferometers at the two LIGO locations. Our immediate future data analysis goals involve 
moving to the analysis of real data from the LIGO and Virgo interferometers. This will create 
additional issues that will need to be studied, such as data quality criteria and the use of 
veto channels. 
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